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A Continuation Ratio Model for 
Ordered Category Items 

Abstract 

Continuation ratio logits are used to model the probabilities of obtaining ordered categories 
in a polytomously scored item. This model is an alternative to other models for ordered 
category items such as the graded response model and the generalized partial credit model. 
The discussion includes a theoretical development of the model, a description of special cases, 
maximum marginal likelihood estimation of the item parameters, and estimation of ability. 
Comparisons of the item response theory models for ordered category items are presented 
using empirical data. 

Key words: item response theory, maximum marginal likelihood, multicategory logit model, 
polytomous response. 
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Introduction 



When a free response item is scored in a dichotomous fashion, for example, either correct 

or incorrect, a single stage decision is performed in a sense, that no further decisions will 

\ 

be made beyond the current decision to be taken. When a free response item is rated in a 
polytomous fashion, a multiple stage decision is required in which dependent decisions are 
made in tandem eventually. Note that the true multiple stage decision process arises only if 
a rater decides not to assign the extreme score or category to the free response in the first 
stage decision because the further decision must be made for such a situation. 

Borrowing terms from the game theory (Luce & RailTa, 1957), the particular alternative 
chosen by a rater at a given decision point is called the “choice,” and the totality of choices 
available to a rater at the decision point constitutes the “move.” A sequence of choices, 
one following another until the rating or scoring of an item is complete, can be called a 
“play” (see also Kim, 1992; Taha, 1987). The play or the rating process for a given item can 
be depicted with a connected graph consists of a collection of nodes and branches between 
pairs of nodes. Human information processing to reach a decision, or to assign a category to 
an item in the current situation, seems to require performing of multiple loops in a certain 
stage and even the repetition of the whole decision process for a given item. For the sake 
of brevity, a connected graph without any loop of branches or any reiterating move, called 
the decision tree, is used to depict the play. The single stage decision and the multiple stage 
decisions are presented in Figure 1 (i.e.. Figures 1-a to 1-d). The decision trees reflect the 
sequential nature of scoring. Each decision point is denoted as a circle and the chance events 
with respective but dependent probabilities are denoted as squares in Figure 1. Although 
other representations of the decision process is possible [e.g., see Figure 2 (i.e.. Figures 2-a 
and 2-b)], the representation in Figure 1 is assumed to facilitate the decision making process 
and to be modeled in this study. 



Insert Figure 1 and Figure 2 about here 



The decision trees for the multiple stage scoring depicted in Figure 1 involve in a set of 
dependent events. The model for the ordered choices ought to reflect the joint probabilities 
and must take into account the conditional probabilities that characterize the dependence. 
The model for ordered category items to be described is called a continuation ratio model. 
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The continuation ratio model was developed to handle polytomous response variables 
in logit models (Fienberg, 1977, 1980; Fienberg & Mason, 1978). Comparisons among the 
continuation ratio model and other models for ordered polytomous responses in logistic 
regression and loglinear modeling can be found in Agresti (1990, 1996), Clogg and Shihadeh 
(1994), and O’Connell (2000). In the item response theory field, Mellenbergh (1995) 
presented conceptual notes on models for discrete polytomous item responses and indicated 
that the continuation ratio model could be considered as a special case of the Bock’s (1972) 
model. Tutz (1990, 1997) and Verhelst, Glas, and de Vries (1997) proposed special cases 
of the continuation ratio model called the sequential item response model and the steps 
model, respectively. Some theoretical considerations of the continuation ratio model and 
other models for polytomous items were presented in Hemker, van der Ark, and Sijtsma 
( 2001 ). 



The Model 

Let the Yij is the random variable designates the rating or scored item response of individual 
i to item j. The model considered in this paper assumes that the manifestation of Yij or 
the probability of Yij to be a specific value depends on a person’s latent ability 9i and a 
vector- valued item characteristics ^j. 

The probability that yij — k given ability 6i and item parameter ^j is 

Prob {vij = k\6i, ^j) = PjkiOi) = — — — ^ ^ (1) 

JJ {1 + exp [~ajh{0i - bjh)]} 




h=l 



for k = Kj. 

Under the assumption of conditional independence, the probability of a response vector 
Vi = is given as 

j 

Prob(j/i|0i,O =p(j/i|^i,6,---,0) = Y[Pjk{0i), (3) 

j=i 
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( 4 ) 



and the joint probability of the response vectors of a sample of / subjects is given as 

Prob(y|0,O =p(yi, • •• = n n 

i=lj=l 

When the above joint probability is considered as a function of unknown parameters 
^ and 9, we call it the likelihood L. Inference of the values of unknown parameters from 
observed data can be accomplished by maximizing the likelihood or its modifications with 
respect to the unknown parameters. 

Estimation 



There are several estimation procedures to obtain parameter estimates in the continuation 
ratio model. Among those, the marginal estimation based on Bock and Aitkin (1981) can 
be considered to be the de facto standard procedure for the estimation of item parameters. 
In order to relate the continuation ratio model under item response theory and under the 
generalized linear model and in order to link other estimation methods used in psychometrics 
that involve the ability parameters, three procedures are presented; the estimation with 
known ability, the joint estimation, and the marginal estimation. Also presented are, in 
conjuction with the marginal estimation, the EM algorithm (i.e., the detailed E-step and 
M-step), model fit statistics, characterizing ability, and the information function. 

The Estimation with Known Ability 



The estimation with know ability is also known as the bioassay solution (see Mislevy & Bock, 
1984). In the estimation with known ability, the 9's are known quantities, determined by 
the experimenter. It is instructive to consider the estimation with known ability first so that 
we may note similarities and differences with the other estimation procedures. Because the 
abilities are known and grouping is employed, the subscript I is used. 

Let, iov k = 1, . . . , Kj — 1, 



Then, 




1 

1 +exp|-o,t(9| - 






1 - p;tW 



n - p;m] 

h=l 

Kj-l 

n 



if A: = 1 
if A: = Kj. 



( 5 ) 



( 6 ) 
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If we let nLi P;M = 1 and ,{9i) = 0, then, without loss of generality, 



fc-i 



pMDi) = n pkw - p;m)1 



( 7 ) 



h=l 



Also let Nji is the number of subjects with ability di to respond item j and rjki is the 
number of subjects with ability 9i with response k to item j. Note that Nji = 3'iid 

I = Then, 






L J 

^=nn 



Ki 



^ TV ; 
(=ij=i t 4 fc=i 



n 



k=l 



( 8 ) 

The right part of the likelihood can be seen as a multinomial distribution M{n, {r^}; {pk}) 
(Agresti, 1990, p. 38), characterized by the sample size n, observed counts {r^}, and cell 
probabilities {p^}, where n = Nji, and = Pjk{9i). The likelihood can be written 

as 

L J 

(9) 



L = n n M(N„, {P,i.m)- 

i=l j=l 

The maximum likelihood solution for the item parameters yields the values that maximize 
the likelihood for given data. 

It can be noted that the likelihood can be written as 

i = n ri f fT ) I' - p;m)Y‘'‘ P hm"’'-'’'' x 



1=1 j=i V 

f Nji - rjii 

V 



[1 - 



Nji - Vjii 



where 



that is. 



■ ■ ■ — r 






[1 - 



rj{Kj-i)i 



U 



1 _ p* (f)\ = 

1 - P,(»-„(ft) 



PlM) 



1 - E P,M) 

h=l 



. ( 10 ) 



( 11 ) 



1 - phm 
1 - p^m 



p,mh 

p,2m 

i-p,mr 
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Pm:-nW 

i-p,M) Pm,-^)W 

If B{n,r;p) denotes the binomial distribution with n trials, r success, and the success 
probability p, then the part of the likelihood consists of B[Nji,rjn-,l — Pji{6i)], B[Nji — 
rju,rj2i'A - Pj2i^i)], •••, B[Nji - Vju - ••• - ~ Pj(Kj-i)i^i)]^ 

consequently, 

i = n n n - e ’■m, v, i - (12) 

l=lj=l k=l h=l 

Equivalently, we may work with the log likelihood for the maximization, 

L j 

logL = ^ ^ constant + rji;log[l - P/i(6»;)] + (iVj; - r^i;) logP/i(6';) + 

;=i j=i 

constant + rj 2 i log[l - Pj 2 {di)] + (iVj; - rju - rj 2 i) \ogP*^{6i)+ 



constant + log[l - + (% “ (13) 

That is, 



L J Kj-'^ [ 

log L = constant + ^ ^ ^ 

;=1 j=i fc=l L 



rju log(l - P;M + JV„ - E log P;M) 



h=l 



( 14 ) 



This expression is maximized by differentiating with respect to the item parameters and then 
finding the values of item parameters that make the derivatives zero. These are likelihood 
equations. For example, the partial derivative with respect to aji is 

= E K - (9. -bp)- (15) 

;=i 

Note that for an item with Kj possible response or scoring categories, there exist 2{Kj — 1) 
item parameters. The partial derivatives with respect to ajk and bjk are: 



91ogL 

ddjk 



= E 



l=i 



k~l 



IV, 1 - E ’■,« - 1V„ - 5: PM) 



h=l 



h=l 



[9i — bjk) 



( 16 ) 



91ogL 

dbjk 



L 



E 



- E 



h=l 




k-\ 



E’-f- p;m) 

h=l J 



^jk 



( 17 ) 



If a vector of item parameters that solves these equations is unique and if the matrix of second 
derivatives of the log likelihood with regard to the item parameters is positive definite, then 
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the resulting values are the maximum likelihood estimates of item parameters. The second 
derivatives are obtained as: 
log I, 



= -il - PStW] (nj, - ‘i; r,,.,) (ft - 

;=i V h=i / 

= j:p;M){i - p;m)] Ui, - E ft„') (ft - 

(JO^jkOjk i=i V h=l J 

log L 






= -T. - PjM)] (n,i - E a] 

l=l \ h=l ) 



2 

'jk 



Let 



' jkl ~ ^jl 



and 



h=l 
k — l 



N'jki = Nji - 

h=l 

Then, the partial derivatives with respect to ajk and bjk can be written as: 

d log L ^ 



da 



= E ViU - I^iuPhW] (»t - bjk) 



;=i 



The second derivatives can also be written as: 






logL 



= - - pmmuoi - bjk) 



1=1 



= E pjMii - p;M)]Niti(<>‘ - 



jk^jk i=i 



= - E piM)(^ - p;,,mNiua 

jk ' 



1=1 



2 

jk 



( 18 ) 

( 19 ) 

( 20 ) 

( 21 ) 

( 22 ) 

( 23 ) 

( 24 ) 

( 25 ) 

( 26 ) 
( 27 ) 



All second derivatives for parameters from different items are zero, so the solutions are 
independent from one item to another. The solutions can be found by Newton-Raphson 
method, carried out item by item, using 






where t designate iterations. 



d^ logL 
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c(t) 



( 28 ) 



( 29 ) 



and 
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d\ogL 






( 30 ) 



For large samples of examinees, the maximum likelihood estimates follow a multivariate 
normal distribution with means equal to the true values and dispersion matrix given by the 
block diagonal matrix 

E = -diag HJ') . (31) 

The Joint Estimation 

In the psychometric setting, the values of 0’s are not known, and so the previous estimation 
with known ability cannot be applied in general. A variation called the joint estimation 
(or joint maximum likelihood estimation) has been implemented in a number of computer 
programs [e.g., LOGIST (Wingersky, Patrick, & Lord, 1999)]. In the joint estimation, the 
log likelihood is maximized, namely, 

I J 



where 



and 



I J K] 

log ^ = mz log Pjk{0i) = iz iz £ Vijk log Pjk{0i), 

i=l j=l i=l j=l k=l 



vu = I ^ = ^ 

1 0 otherwise 






Pjki^i) = n Pjk{0ir^^^. 

k=l 



(32) 



(33) 



(34) 



Maximization is carried out with respect to the item parameters as well as 0j parameters. 
The partial derivatives with respect to item parameters are obtained in a similar manner as 
in the previous estimation with known ability and not presented in detail. For example, 



dlogL 1 dPjk{0j) 

There are / additional likelihood equations for the ability parameters 

aiogL 1 



(35) 



de. 



J Ki 

= go 



a\ogp,t{e,) 

Vijk QQ 



(36) 



j=l k=l 

The last term of the equation may not be easy to solve. We can use the relationship, however, 

k—l 



iogP„{e,) = Y. ionP;M) + iog|i - PJtiDi)]- 

8 



(37) 



h=l 
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For example, if we have an item with Kj = 4, then 



log Pjk{0i) 



' log[l - P*i{di)] if A: = 1 

iogp;,{e,)] + iog[i-p;,{e,)] nk = 2 

log p*,{9,) + log p;,{9^) + iog[i-p;,{e^)] nk = 3 
. log Ph + l°g Pp + l°g Pp if A: = 4. 



(38) 



Hence, 



d log Pjk{9i) 
39, 



ajiP*y{9i) if A: = 1 

— aji[l — Pji{9i)\ + aj2Pj2(9i) if k = 2 

-a,x[l - P;,{9,)] - a,2[l - P*2m + a,3P;M) if ^ = 3 

I -a,x[l - P;,{9,)] - a,2[l - P*2m - a,3[l - P;3{9,)] if k = 4 



(39) 



and the second derivatives are 

d^iogPjtje,) 

def 

f -S-if’AW)(i - PAWl = l 

-aj.P'Mll - P;M)] - - PMm if*: = 2 

- PMm - a%PJM)ll - PhiOi)] if *: = 3 

I -alP;M)[l - Ph(0i)] - o-hP^Sim - P‘2(e.)] - <^%PiM)\l - P;M)\ if *: = 4. 



Rather than solving for all item and ability parameters simultaneously, it is customary 
to maximize the log likelihood in cycles (i.e., the Birnbaum paradigm). The maximum 
likelihood ability estimate based on the Newton-Raphson method is 







3 log 1/39, 
3^ log L 1 39‘! 



(t) 



(41) 



This joint estimation may have some problems (see Baker, 1992). The most serious one is 
that estimates of item parameters are inconsistent for tests of finite length (Andersen, 1972) 
even as the number of subjects increases without bound. 



The Marginal Estimation 

Suppose that it is reasonable to think of subjects as a random sample from a population in 
which ability is distributed in accordance with the density g{9). Rather than obtaining a 
possibly unstable point estimate of 9, given y,, we could use the Bayes theorem to compute 
the entire posterior distribution of 9 given using 

^ ^ p{yi\d)g{0) 

[ Piyi\0)9{0)d9 

Je 
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Then, the discrete distribution to be used in the estimation of item parameters could be 



formed not by placing the data for an individual all at one point estimate but by distributing 
it across the ability scale in proportion to the posterior probability. The actual steps in the 
marginal estimation are similar to those in the joint estimation. The principal difference 
is in distributing each subject’s data over the ability scale in proportion to the posterior 
probability rather than placing entire mass at one point. It is intuitively clear that this 
procedure is especially advantageous for short tests since the ability estimates will be most 
unstable, and be most wrong in assigning a single value for the unknown ability. Moreover, 
the posterior density exists for all response patterns, including those for which the maximum 
likelihood estimates are infinite. As test length increases, the joint estimation and the 
marginal estimation may become identical. The catch is that to compute p{yi\9), we have 
to know the item parameters. The marginal estimation, hence, is necessarily iterative like 
the joint estimation. 

Under the usual assumption of conditional independence, the probability of response 
vector Hi from a subject with ability 9 is given by 

pfei»,e) = ■ (43) 

j=i 

The probability of observing the response pattern from a subject selected at random from 
a population which 9 is distributed in accordance with the density function g{9) is given by 
the integral of the probability over the population 



This is the marginal probability of response pattern yi with respect to the population density 
g, conditioned upon the item parameters. The marginal probability of a sample of response 
pattern is the product of the marginal probability over subjects and can be written as 



For given data, this is the likelihood function for the item parameters and the parameters 
of the population distribution. Assuming g to be known, the item parameters are estimated 
by maximizing the log of the marginal likelihood. 



p(»ik) = J p{yi\9,()sWM 



(44) 




(45) 




( 46 ) 



This is done by taking the first derivatives with respect to the item parameters and setting 
the results to zero to produce the likelihood equations. Whether a solution is a maximum 
can be determined by examining the matrix of second derivatives of values of the likelihood 
in the neighborhood of the solution. The value of —2 times the likelihood can be obtained 
at each iteration, and the values should generally be decreasing (see Mislevy & Bock, 1984 
for discussion of advantages and disadvantages of the marginal estimation). 

The maximum marginal likelihood equations are needed. For example, the first derivative 



of ajk is 



51ogL 

5a, 



d 



jk 



E 



Li=l 

p{vi\0 dajk 
1 d 



E.logP(^il'f) 
[p(yik)] 



PiVilO 9ajk 

f ^ 

^,v(vi\e)J 

^ I 1 5a, 



^iP{yi\()J dajk 
I r( d 



[logp(yi|^,0] 



- / p{yi\d,i)g{d)d9 

ik J 

\p{yi\9,Ci\9{9)d9 
p{yi\9^C)9i9)' 



p{yi\0 



de 



(47) 



= E/ (48) 

We see that the first derivative in the marginal estimation has the form of the first derivative 
in the estimation with known ability, integrated over the posterior density of 6 given data. 

Bock and Lieberman (1970) gave a numerical solution to the likelihood equations under 
the two-parameter normal ogive model. Their solution was based on a straightforward 
application of the Newton-Raphson method. The first derivatives were similar to the formula 
above. The values of the integral which cannot generally be solved analytically were obtained 
by quadratures. Also computed was an approximation of the expected matrix of second 
derivatives. In order to obtain the partial derivative for logp(yi|0, ^), let us use an indicator 
variable, 

{ 1 if move k occurred for subject i and item j , 

0 otherwise, 

where /c = 1, . . . , {Kj — 1). If a subject i has been administered item j and assigned score k 
{k ^ Kj), then all Uijh, h = 1, . . . ,k, are unity. If assigned Kj, then all Uijh, h = 1, . . . ,Kj — l, 
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are unity. Note that 



= E / [(1 - ftjii) - PltW] {« - bjt)pmvi,^)dl) (50) 

i=l-' 

r I I 

= / - yijk)p{G\yi,0 - Pjki^)^'fJ'ijkp{d\yuO 

■' U=1 i=l 

In 1981, Bock and Aitkin reformulated the likelihood equations to produce a solution that 
avoids the computational problems of the Bock and Lieberman solution. The procedure is 
equivalent to solution by Dempster, Laird, and Rubin’s (1977) generalized EM algorithm. 
The approximation to integration was done by numerical quadratures. Integration by 
quadrature replaces the problem of finding the area under a curve by the simpler problem of 
finding sum of the areas of a finite number of rectangles which approximate the area under 
the curve. The quadrature approximation for p{yi\^) yields 



{6 - bjk)de. (51) 



p(y,m = 'tp(v,\x,.()MX,). 

q=l 



(52) 



where the points at which the function is evaluated, Ax, ... , Xq , are referred to as quadrature 
points, and associated with them are the quadrature weights A{Xg). The weights take into 
account the height of the density function g in the neighborhood of the A’s and the width 
of the rectangles. We may use the standard normal density for g (Stroud & Secrest, 1966). 
Using the quadratures, the likelihood equation for ajk is 



5 log L 
dai 



•jk 



Q 

= E 

q=l 



- yijh)p{Xq\yuO - Pjki^k)'^Uijkp{Xg\yi,^) 



. 1=1 



i=l 



(A, - bjk) (53) 



where 



and 



= E L't, - (X, - b„), 

q=l 



I 

'^jkq ~~ 0 

i=l 

I 

= Y. - yijk) IpiVilOr^ PiVil^q: OM^q) 

i—l 

I 

= Y 

i=i 

I 

^jkq ^ 0 

i=l 



yijk)p{yi\^qi 

l:%,p{yi\x„i)A{x,) 



(54) 

(55) 

(56) 

(57) 

(58) 
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( 59 ) 



i=l 

/ 

= E 

i=l 

These are the expected number of subjects and subjects assigned category higher than k to 
item j at quadrature point q, given the population density g and the item parameters. Note 
that fjkq and Njkq depend on the p{0\yi, terms, which in tern depend on the unknown item 
parameters. This means the likelihood equations are implicit, and must be solved iteratively. 
In analogy to the EM algorithm, each iteration can be broken into two steps. The E-step 
evaluates fjkq and Njkq for all j, k, and q for provisional values of the item parameters. The 
M-steps solves the likelihood equations of the item parameters with the fjkq's and Njkq's 
treated as known values. 

For a given item, with fjkq and Njkq taken as known the likelihood equations do not 
depend on the item parameters of any other item. The M-step consists of independent 
solutions to each of the J items in and of itself. Computation is very reduced from the 
Bock and Lieberman solution since each item solution can be accomplished by the Newton- 
Raphson method to find the item parameters alone, just as in the estimation with known 
ability. The Hessian produced in each item’s M-step can be used to produce approximate 
standard errors for the item parameters. These M-step standard errors tend to be too 
small, however, because they do not take into account the dependence among items in the 
full likelihood solution nor the fact that fjkq'^ and Njkq's are estimated rather than known 
quantities. The full information matrix may be empoyed in the Fisher-scoring solution to 
obtain more precise standard errors (see Mislevy & Bock, 1990). 

Note that the density function g may be approximated together with item parameters 
(see Mislevy, 1984; Mislevy Sz Bock, 1984) 

The E-Step 

The Bock and Aitkin algorithm is based on a discrete representation of g{9) and the integrand 
of p{yi\^), both continuous densities, over Q quadrature points, Xq, with q = 1,...,(5. 
Such a discrete representation of the continuous densities may be made arbitrarily close 
to continuous reality by choosing Q large, just as numerical integration may be made 
arbitrarily accurate by using sufficient quadrature points. However, large values of Q slow 
the computations (e.g., we may use Q = 10). 



'^ijkP{yi\Nq, ^)A(A^g) 
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Under the assumption that the population is composed of individuals who are members 
of Q discrete classes with values Xi, . . . ,Xq on the latent variable, complete data sufficient 
statistics for the estimation of item parameters for item j would consist of a table of counts 
{'fjkq}> ill which each element Tjkg is the number of individuals in class X^ selecting or assigned 
response greater than k on item j. So the E-step of the Bock and Aitkin algorithm consists of 
the expected values of the Vjkq, conditioned on the data and the current provisional estimates 
of the item parameters, as well as the expected value of the Nj^q, that is, the number of 
individuals in class Xq. 

Using the provisional estimates of the item parameters for each item, we can compute 
the expected values of the Vjkq as 



I 

E/ij jkq\y'>^^ — ^jkq ~ ~ yijk)p{Xq\yi, 

i=l 



( 61 ) 



= - yijk) [p(yill)] P{yi\Xq,i)A{Xq) 

i=l 

/ 

= E 

i=l 

and 

E{Nji^q\y^ = Njkq — C) 

i=l 

= [p(Kia]”‘p(Kl^„0-4W 

i=l 






( 62 ) 

( 63 ) 



( 64 ) 

( 65 ) 



'^ijkP{yi\Xq, ^)A(^Xq) 



( 66 ) 



where 



Q 

p{yi\i) = T.piyi\^‘<^i)MX‘<)- 



9=1 

Note that, while fj^q and Njkq are computed in a loop over the observed response patterns, 
the values Pjk{d) for each item are required only for a fixed set of Q values of Xq. If those 
values are placed in a table before the E-step is begun, the computations involved in fjkq and 
Njkq are limited to table look-up, multiplication, and addition. The E-step yields a set of J 
{Kj — 1 ) X Q tables of non-integral artificial counts which are used as data in the M-step. 
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The M-Step 



The M-step consists of maximum likelihood estimation of the parameters for all items 
j = 1, . . . , J, using the tables of expected values fj^q and Njkq as data. It is simply nonlinear 
regression. 

In terms of the expected data, the loglikelihood is 



log L oc 



J Kj-l Q 

E E E [(JVy*, - fjk,) iog[i - P‘t(.x,)] + . 



j=l h=l q=l 



( 67 ) 



Standard gradient methods may be used to maximize logL over the parameter space. The 
maximum likelihood estimates of the item parameter are obtained where 



dlogL 

d^jk 



0 



for all parameters in ^j. 

]B0Cctij.s0 

■ dlogL e fjt,-Njt,P;k{X,) (dPUX,Y 



S c^(^,)(i - p;k(x,)i V Si 



>jk 



we have 



and 



SPjt[X,) 

ddji^ 

spux,) 



ab 






p,ux,)ii - p;t(x,)]{x, - b„) 



= -P-t{X,)[l - PX(X,)]ajf 



The second derivatives can also be written as: 
logL 



7^ = - E PMx,)li - p;,(x,)]N,t,{x, - b,,Y 

1=1 

= E p;^x,)ii - P;,{X,)]Nj,,{X, - b„)a„ 

OO^jkOjh i^i 

logL 



Sb% 



= - E P,\(X,)ll - PMX,mr.,4 



1=1 



^jk 



( 68 ) 



( 69 ) 

( 70 ) 

( 71 ) 

( 72 ) 

( 73 ) 

( 74 ) 



Using the updated item parameters from the M-step, the sequence of the E-step and the 
M-step is repeated either the parameters stabilize or a fixed number of cycles is reached. 
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Model Fit Statistics 



The fit of the model and subject ability distribution can be compared against the general 
multinomial alternative if the following conditions are met: (1) maximum likelihood 
estimation is employed. (2) All subjects respond to all items. (3) The number of item 
is less than or equal to 10. Under these conditions, the following statistic has approximately 
a chi-square distribution in large samples of subjects: 



where summation runs over all potential response patterns yi, r{yi) is the number of 
occurrences of a given pattern, and p{yi\^) is the marginal probability. Terms for patterns 
that have not been observed are set to zero. Degrees of freedom are given by the number 
of potential response patterns minus the number of parameters estimated minus one. The 
approximation to the chi-square distribution is poor when expected counts are small. 

When strict maximum likelihood estimation is employed, likelihood ratio tests of model 
fit may also be obtained. Two types of tests are available; a comparison of nested models 
and a comparison of the model against the general multinomial alternative. The value of 
—2 times the log likelihood can be obtained after estimation cycle. For large samples of 
subjects values between the augmented model A and the more restrictive compact model C 
has approximately a chi-square distribution, that is. 



the augmented model. This test may be used to compare the fit of nested model when 
separate estimation procedures are performed to provide maximum likelihood estimates of 
item parameters under both models. 

Characterizing 6 

The parameter estimation described above usually has the goal of “calibrating” a set of test 
items, after which the item parameters are to be taken as known and used to characterize 
9 for examinees who produce a particular response pattern y^. (Note that another way to 
estimate the 9 is to use the method of maximum likelihood as we have seen in the joint 
estimation earlier.) 




(75) 



<5^ = (-21ogLc) ^ (-21ogLA), 



(76) 



where the degrees of freedom equal to the number of additional parameters estimated under 



Given a set of item parameters, the posterior density for 9 is 



p{0\yi) = ^P3k{0)g{e). 

i=i 



( 77 ) 



If the model is correct, the above equation describes the distribution of examinees who 
respond with pattern yi. For more than a few items, the posterior density is roughly 
Gaussian in shape, and so it may also be described by estimates of its location and spread. 
The procedure, called the maximum a posteriori (MAP) method, is to use the mode as an 
estimate of the location of the posterior density, where 



d\ogp{9\yi) ^ ^ d\ogPjk{9) ^ d\ogg{9) ^ ^ 



dd 



j=i 



dd 



d9 



(78) 



with the posterior variance approximated by the negative inverse of 

'd'^ log p{9\yiY 



E 



dO^ 



(79) 



The modal estimate is practical to compute as long as g{9) is a reasonable function, and 
easy if g{9) is normal. Because there is no guarantee that the posterior density is unimodal, 
potential multimodality may present problems for mechanical use of modal estimates. 

It is fairly straight forward, however, to numerically integrate the posterior distribution 
to obtain its mean and variance. The mean has been called the expected a posterior (EAP) 
estimate of 9 (see Bock & Mislevy, 1982). An advantage of the EAP procedure over modal 
estimation is that the derivative of g{9) is not required. Therefore g{9) may take any form 
describable as a histogram with finite variance. 



The Information Functioii 



The amount of information yielded by the item j at ability level is 

Kj 

k—l 



(80) 



where the quantity Ijk[9)Pjk{9) is the amount of information share of category k and iA») 
is the information function of an item response category k defined as 






dHogPjk{9) 

d9^ 



d \PjM 

d9 [Pjk{9)_ 



P'jMPjki^) - [PjkiO)? 

[PA^)V 



( 81 ) 



O 
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where = dPjk{9)/de and P-^{e) = d^Pjk{0)/dd\ Hence, 




( 82 ) 



For the continuation ratio model, 




(83) 



where rih=i Pjhi^) = 1- The information function for a test with J item is 



j 



m = Y.m- 



(84) 



An Example 



Data 

The 1998 Fall data of the Baseline version of the Georgia Kindergarten Assessment Program- 
Revised (GKAP-R) were analyzed. The Baseline version of the GKAP-R is a performance 
assessment rating instrument that consists of ten polytomously scored items with three 
ordered categories. For the purpose of the present example, four mathematics items were 
selected. 

The full description of the GKAP-R can be obtained in the Georgia Department of 
Education web site: 

http : //www .doe . kl2 .ga.us/ curriculum/testing/gkap . asp 
A total of 105,731 students who did not have any omitted or unreached responses were used. 

Parameter Estimation 

The marginal estimation was carried out on four mathematics items form the test of the 
GKAP-R. Ten quadrature fractile points were used for ability integration during calculations. 

After several cycles of the expected and maximization iterations, the item parameter 
estimates were stable to four significant figures. Goodness of fit for the model was assessed. 
The resulting chi-square value was 2067.44312. The chi-square has the degrees of freedom 
equal to the number of response patterns minus the number of parameters estimated minus 
one (i.e., 68=81-12-1). 



ERIC 



18 



20 



Although the solution shows remarkably good fit, the chi-square is large mainly due 
to the large sample size. Taken into account with the quite large sample size used in the 
calibration, the agreement of the observed and estimated frequencies of the response patterns 
seems reasonably good. 

Category Response Functions 

Category response functions of the four items under the continuation ratio model are shown in 
Figure 3. Item parameter estimates from which the functions were constructed are presented 
in Table 1. 



Insert Figure 3 and Table 1 about here 



To make comparisons, category response functions of the items under the graded response 
model (Samejima, 1969, 1972, 1997), the generalized partial credit model (Muraki, 1992, 
1997), and the partial credit model (Masters, 1982; Masters & Wright, 1997) are shown 
in Figures 4, 5, and 6, respectively. The respective sets of item parameter estimates are 
presented in Tables 2, 3, and 4. The computer program MULTILOG (Thissen, 1991) was 
used to obtain parameter estimates under the graded response model, the generalized partial 
credit model, and the partial credit model (see Childs & Chen, 1999, for the comparability 
of parameter estimates from polytomous models). 



Insert Figures 4-7 and Tables 2-4 about here 



For each of the items, the monotonic decreasing curve corresponds to the lowest category; 
the middle curve corresponds to the middle category; the monotonic increasing curve 
corresponds to the highest category. These indicate in each item that the examinees of 
indefinitely low ability will be assigned the lowest category and, conversely, that examinees 
of indefinitely high ability will be assigned the highest category. 

Some extent, the form of the category response functions depicted in Figures 3 to 6, 
although obtained from the different values of item parameters based on the different models, 
are very similar except for the first and second categories of Item 4 (see Figure 7 for overall 
comparisons). 




19 21 



Information Functions 



Item information shares of the three categories of Item 1, item information functions of the 
four items, and the test information function under the continuation ratio model are shown in 
Figure 8. The same sets of functions under the graded response model, the generalized partial 
credit model, and the partial credit model are shown in Figures 9, 10, and 11, respectively. 



Insert Figures 8-12 about here 



The overall information from the continuation ratio model can be compared with values 
obtained from the graded response model, the generalized partial credit model, and the 
partial credit model. The continuation ration model yielded a slightly higher information 
for the lower ability levels than other models (see Figure 12 for overall comparisons). 

Ability Estimates 

The expected a posteriori method was used to estimate the ability parameters. The ability 
estimates from the continuation ratio model are presented in Table 5. Table 5 also contains 
the corresponding standard deviations for the 81 response patterns. The ability estimates 
from the other models are presented in Tables 6 to 8. All ability estimates and the posterior 
standard deviations are very similar for all models. The differences in the estimates and 
the posterior standard deviations among the models occurred in mostly second and third 
decimal places. 



Insert Tables 5-8 about here 



Summary 

An item response theory model which retains the sequential order of the item response 
categories is proposed for tests consisting of ordered category items. The model makes 
use of continuation ratio logits to describe the probability of assigning each category in 
terms of two item parameters and the ability parameter. Procedures based on the method 
of maximum likelihood are described for estimating item and ability parameters, and for 
testing goodness of fit of the model. 
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Three procedures of estimation item (and ability) parameters of the model are described. 
In the first estimation, the maximum likelihood estimates of item parameters are obtained 
assuming the ability parameters are known. In the joint estimation, item and ability 
estimates are obtained jointly by maximizing the likelihood function with respect to the item 
and ability parameters. In the marginal estimation, the likelihood function is integrated with 
respect to the ability distribution in order to obtain maximum marginal likelihood estimates 
of the item parameters. For the marginal estimation, the ability parameters are to be 
estimated subsequently assuming that the item parameter estimates are known values. The 
methods of maximum likelihood, the expected a posteriori, and the maximum a posteriori 
are available for the ability estimation. 

An application of the maximum marginal likelihood method with the expected a 
posteriori estimation of ability is reported using data consisting of the responses of 105,731 
examinees to a four item mathematics test of the Baseline version of the GKAP-R. 
Comparisons of the estimates of item and ability parameters from the continuation ratio 
model, the graded response model, and the generalized partial credit model as well as the 
partial credit model show the results of the models to be closely comparable. The results of 
the test of goodness of fit of the models show also similar comparability. 

An information analysis is carried out to compare the precision of estimating ability 
under the models of polytomously scored items. The test information functions from the 
models show different patterns of precision along ability scale. For examinees below median 
ability, the continuation ratio model results in an increase in precision. For examinees above 
median ability, the continuation ratio model results in a slight decrease in precision. 

The model considered in this paper can be applied to polytomous response items that 
have special characteristics. The polytomous responses are ordered instead of nominal. The 
categories or ordered levels of the response are assigned in a sequential manner. Not all 
polytomous ordered responses have such characteristics. 
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Table 1 

Continuation Ratio (CR) Model Item Parameter Estimates of the Mathematics Baseline Section 
of the Georgia Kindergarten Assessment Program-Rivised 



CR Item Parameter Estimates 



Item 


aji 


bji 


aj2 


bj2 


Ml 


2.37 


-1.87 


1.63 


-1.77 


M2 


3.05 


-0.87 


2.04 


-0.04 


M3 


2.17 


-1.14 


1.67 


0.47 


M4 


2.03 


-2.04 


0.86 


-1.12 







Table 2 

Graded Response (GR) Model Item Parameter Estimates of the Mathematics Baseline Section 
of the Georgia Kindergarten Assessment Program-Rivised 



GR Item Parameter Estimates 


Item 


aj 


bji 


bj2 


Ml 


2.07 


-1.97 


-1.33 


M2 


2.62 


-0.90 


0.09 


M3 


1.98 


-1.17 


0.52 


M4 


1.23 


-2.68 


-0.71 





Table 3 

Generalized Partial Credit (GPC) Model Item Parameter Estimates of the Mathematics Baseline Section 

of the Georgia Kindergarten Assessment Program-Rivised 



GPC Item Parameter Estimates 



Item 


aji 


\ 


CljS 


Cji 




CjZ 


Ml 


-1.50 


o 

o 

o 


1.50 


-2.29 


-0.25 


2.54 


M2 


-2.22 


0.00 


2.22 


-1.21 


0.58 


0.64 


M3 


-1.74 


0.00 


1.74 


-1.04 


0.92 


0.12 


M4 


-1.01 


0.00 


1.01 


-1.88 


0.44 


1.45 
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Table 4 

Partial Credit (PC) Model Item Parameter Estimates of the Mathematics Baseline Section 
of the Georgia Kindergarten Assessment Program-Rivised 



PC Item Parameter Estimates 



Item 


aji 


aj2 


Oj3 


Cji 


Cj2 




Ml 


-1.54 


0.00 


1.54 


-2.33 


-0.24 


2.57 


M2 


-1.54 


0.00 


1.54 


-0.89 


0.35 


0.54 


M3 


-1.54 


0.00 


1.54 


-0.96 


0.84 


0.12 


M4 


-1.54 


0.00 


1.54 


-2.41 


0.65 


1.76 
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Table 5 

Expected A Posteriori (EAR) Ability Estimates and the Corresponding 
Posterior Standard Deviation (PSD) Using the Continuation Ratio (CR) Model for the 
Mathematics Baseline Section of the Georgia Kindergarten Assessment Program- Rivised 



Response 

Pattern 


CR Model 


Frequency 


Response 

Pattern 


CR Model 


Frequency 


EAP 


PSD 


EAP 


PSD 


1111 


-2.21 


0.56 


1971 


2223 


-0.45 


0.37 


913 


1112 


-1.71 


0.47 


1209 


2231 


-0.55 


0.38 


20 


1113 


-1.55 


0.40 


799 


2232 


-0.33 


0.42 


167 


1121 


-1.63 


0.44 


255 


2233 


-0.15 


0.49 


321 


1122 


-1.37 


0.39 


636 


2311 


-0.86 


0.50 


18 


1123 


-1.22 


0.46 


594 


2312 


-0.55 


0.40 


45 


1131 


-1.36 


0.40 


17 


2313 


-0.41 


0.41 


48 


1132 


-1.02 


0.51 


61 


2321 


-0.49 


0.38 


22 


1133 


-0.80 


0.50 


101 


2322 


-0.26 


0.45 


191 


1211 


-1.49 


0.38 


83 


2323 


-0.06 


0.52 


369 


1212 


-1.24 


0.44 


126 


2331 


-0.21 


0.48 


12 


1213 


-1.04 


0.50 


116 


2332 


0.18 


0.55 


86 


1221 


-1.15 


0.48 


48 


2333 


0.45 


0.57 


336 


1222 


-0.76 


0.47 


222 


3111 


-1.47 


0.35 


935 


1223 


-0.60 


0.41 


304 


3112 


-1.24 


0.44 


2258 


1231 


-0.74 


0.47 


6 


3113 


-1.04 


0.50 


2645 


1232 


-0.47 


0.38 


60 


3121 


-1.15 


0.48 


364 


1233 


-0.33 


0.43 


85 


3122 


-0.76 


0.48 


2423 


1311 


-1.14 


0.48 


17 


3123 


-0.58 


0.44 


3832 


1312 


-0.75 


0.48 


26 


3131 


-0.73 


0.48 


41 


1313 


-0.57 


0.43 


35 


3132 


-0.43 


0.43 


321 


1321 


-0.66 


0.44 


15 


3133 


-0.24 


0.50 


844 


1322 


-0.42 


0.39 


84 


3211 


-0.97 


0.50 


340 


1323 


-0.26 


0.46 


134 


3212 


-0.62 


0.42 


1393 


1331 


-0.38 


0.42 


5 


3213 


-0.48 


0.40 


2442 


1332 


-0.06 


0.53 


33 


3221 


-0.55 


0.39 


395 


1333 


0.19 


0.56 


101 


3222 


-0.34 


0.41 


4886 


2111 


-1.73 


0.47 


697 


3223 


-0.16 


0.49 


10392 


2112 


-1.47 


0.33 


935 


3231 


-0.29 


0.44 


81 


2113 


-1.36 


0.37 


736 


3232 


0.06 


0.54 


1533 


2121 


-1.42 


0.35 


268 


3233 


0.32 


0.56 


5519 


2122 


-1.16 


0.47 


741 


3311 


-0.53 


0.41 


200 


2123 


-0.95 


0.51 


723 


3312 


-0.26 


0.47 


703 


2131 


-1.14 


0.48 


15 


3313 


-0.04 


0.55 


1372 


2132 


-0.75 


0.48 


77 


3321 


-0.22 


0.47 


331 


2133 


-0.57 


0.43 


120 


3322 


0.17 


0.55 


4714 


2211 


-1.32 


0.39 


147 


3323 


0.44 


0.57 


14402 


2212 


-0.98 


0.50 


291 


3331 


0.26 


0.57 


155 


2213 


-0.77 


0.48 


278 


3332 


0.72 


0.61 


3846 


2221 


-0.87 


0.50 


181 


3333 


1.07 


0.68 


23221 


2222 


-0.57 


0.38 


1243 
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Table 6 

Expected A Posteriori (EAP) Ability Estimates and Their Corresponding 
Posterior Standard Deviations (PSD) Using the Graded Response (CR) Model for the 
Mathematics Baseline Section of the Georgia Kindergarten Assessment Program- Rivised 



Response 

Pattern 


GR Model 


Frequency 


Response 

Pattern 


GR Model 


Frequency 


EAP 


PSD 


EAP 


PSD 


1111 


-2.18 


0.60 


1971 


2223 


-0.51 


0.34 


913 


1112 


-1.84 


0.54 


1209 


2231 


-0.55 


0.42 


20 


1113 


-1.61 


0.50 


799 


2232 


-0.46 


0.39 


167 


1121 


-1.59 


0.49 


255 


2233 


-0.25 


0.48 


321 


1122 


-1.40 


0.46 


636 


2311 


-0.95 


0.55 


18 


1123 


-1.14 


0.53 


594 


2312 


-0.76 


0.51 


45 


1131 


-1.38 


0.56 


17 


2313 


-0.49 


0.49 


48 


1132 


-1.14 


0.56 


61 


2321 


-0.50 


0.47 


22 


1133 


-0.80 


0.56 


101 


2322 


-0.39 


0.45 


191 


1211 


-1.45 


0.47 


83 


2323 


-0.11 


0.53 


369 


1212 


-1.27 


0.49 


126 


2331 


-0.06 


0.57 


12 


1213 


-0.99 


0.53 


116 


2332 


0.06 


0.55 


86 


1221 


-0.97 


0.52 


48 


2333 


0.40 


0.53 
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-0.79 
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-1.43 


0.48 
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-0.58 
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-1.25 
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2258 
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6 


3113 
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1232 
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60 
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-0.30 
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85 
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0.55 


35 


3132 


-0.44 


0.51 


321 


1321 


-0.62 


0.53 


15 


3133 


-0.10 


0.58 
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0.54 
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1331 
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0.60 


5 


3213 


-0.47- 
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2442 


1332 


0.00 
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0.38 
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0.52 


5519 


2122 


-1.22 


0.48 


741 


3311 


-0.42 
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703 


2131 


-1.12 


0.53 


15 


3313 


0.09 


0.59 


1372 


2132 
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Table 7 

Expected A Posteriori (EAP) Ability Estimates and Their Corresponding 
Posterior Standard Deviations (PSD) Using the Generalized Partial Credit (GPC) Model for the 
Mathematics Baseline Section of the Georgia Kindergarten Assessment Program- Rivised 



Response 

Pattern 


GPC Model 


Frequency 


Response 

Pattern 


GPC Model 


Frequency 


EAP 


PSD 


EAP 


PSD 


1111 


-2.15 


0.59 


1971 


2223 


-0.47 


0.39 


913 


1112 


-1.83 


0.52 


1209 


2231 


-0.51 


0.39 


20 


1113 


-1.60 


0.44 


799 


2232 


-0.35 


0.43 


167 


1121 


-1.66 


0.46 


255 


2233 


-0.12 


0.52 


321 


1122 


-1.47 


0.42 


636 


2311 


-0.73 


0.46 


18 


1123 


-1.27 


0.47 


594 


2312 


-0.55 


0.39 


45 


1131 


-1.33 


0.45 


17 


2313 


-0.39 


0.41 


48 


1132 


-1.09 


0.51 


61 


2321 


-0.44 


0.40 


22 


1133 


-0.83 


0.49 


101 


2322 


-0.25 


0.48 


191 


1211 


-1.56 


0.43 


83 


2323 


0.02 


0.54 


369 


1212 


-1.38 


0.44 


126 


2331 


-0.06 


0.53 


12 


1213 


-1.16 


0.50 


116 


2332 


0.24 


0.55 


86 


1221 


-1.23 


0.49 


48 


2333 


0.53 


0.55 


336 


1222 


-0.97 


0.52 


222 


3111 


-1.42 


0.43 


935 


1223 


-0.72 


0.46 


304 


3112 


-1.21 


0.49 


2258 


1231 


-0.78 


0.48 


6 


3113 


-0.95 


0.52 


2645 


1232 


-0.59 


0.40 


60 


3121 


-1.03 


0.52 


364 


1233 


-0.43 


0.40 


‘ 85 


3122 


-0.77 


0.48 


2423 


1311 


-1.11 


0.51 


17 


3123 


-0.58 


0.40 


3832 


1312 


-0.84 


0.50 


26 


3131 


-0.62 


0.42 


41 


1313 


-0.63 


0.42 


35 


3132 


-0.47 


0.39 


321 


1321 


-0.68 


0.44 


15 


3133 


-0.29 


0.46 


844 


1322 


-0.51 


0.39 


84 


3211 


-0.90 


0.51 


340 


1323 


-0.35 


0.43 


134 


3212 


-0.67 


0.44 


1393 


1331 


-0.40 


0.41 


5 


3213 


^0.50 


0.39 


2442 


1332 


-0.19 


0.50 


33 


3221 


-0.55 


0.39 


395 


1333 


0.09 


0.55 


101 


3222 


-0.39 


0.41 


4886 


2111 


-1.71 


0.48 


697 


3223 


-0.18 


0.50 


10392 


2112 


-1.51 


0.42 


935 


3231 


-0.25 


0.48 


81 


2113 


-1.32 


0.46 


736 


3232 


0.02 


0.54 


1533 


2121 


-1.38 


0.44 


268 


3233 


0.32 


0.54 


5519 


2122 


-1.15 


0.50 


741 


3311 


-0.47 


0.39 


200 


2123 


-0.89 


0.51 


723 


3312 


-0.30 


0.45 


703 


2131 


-0.96 


0.52 


15 


3313 


-0.05 


0.53 


1372 


2132 


-0.72 


0.46 


77 


3321 


-0.13 


0.52 


331 


2133 


-0.54 


0.39 


120 


3322 


0.17 


0.55 


4714 


2211 


-1.28 


0.47 


147 


3323 


0.46 


0.54 


14402 


2212 


-1.03 


0.52 


291 


3331 


0.38 


0.54 


155 


2213 


-0.77 


0.48 


278 


3332 


0.69 


0.57 


3846 


2221 


-0.84 


0.50 


181 


3333 


1.08 


0.67 


23221 


2222 


-0.63 


0.42 


1243 
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Table 8 

Expected A Posteriori (EAP) Ability Estimates and Their Corresponding 
Posterior Standard Deviations (PSD) Using the Partial Credit (PC) Model for the 
Mathematics Baseline Section of the Georgia Kindergarten Assessment Program- Rivised 



Response 

Pattern 


PC Model 


Frequency 


Response 

Pattern 


PC Model 


Frequency 


EAP 


PSD 


EAP 


PSD 


1111 


-2.19 


0.58 


1971 


2223 


-0.43 


0.44 


913 


1112 


-1.74 


0.48 


1209 


2231 


-0.74 


0.49 


20 


1113 


-1.46 


0.41 


799 


2232 


-0.43 


0.44 


167 


1121 


-1.74 


0.48 


255 


2233 


-0.05 


0.55 


321 


1122 


-1.46 


0.41 


636 


2311 


-1.15 


0.50 


18 


1123 


-1.15 


0.50 


594 


2312 


-0.74 


0.49 


45 


1131 


-1.46 


0.41 


17 


2313 


-0.43 


0.44 


48 


1132 


-1.15 


0.50 


61 


2321 


-0.74 


0.49 


22 


1133 


-0.74 


0.49 


101 


2322 


-0.43 


0.44 


191 


1211 


-1.74 


0.48 


83 


2323 


-0.05 


0.55 


369 


1212 


-1.46 


0.41 


126 


2331 


-0.43 


0.44 


12 


1213 


-1.15 


0.50 


116 


2332 


-0.05 


0.55 


86 


1221 


-1.46 


0.41 


48 


2333 


0.45 


0.59 


336 


1222 


-1.15 


0.50 


222 


3111 


-1.46 


0.41 


935 


1223 


-0.74 


0.49 


304 


3112 


-1.15 


0.50 


2258 


1231 


-1.15 


0.50 


6 


3113 


-0.74 


0.49 


2645 


1232 


-0.74 


0.49 


60 


3121 


-1.15 


0.50 


364 


1233 


-0.43 


0.44 


85 


3122 


-0.74 


0.49 


2423 


1311 


-1.46 


0.41 


17 


3123 


-0.43 


0.44 


3832 


1312 


-1.15 


0.50 


26 


3131 


-0.74 


0.49 


41 


1313 


-0.74 


0.49 


35 


3132 


-0.43 


0.44 


321 


1321 


-1.15 


0.50 


15 


3133 


-0.05 


0.55 


844 


1322 


-0.74 


0.49 


84 


3211 


-1.15 


0.50 


340 


1323 


-0.43 


0.44 


134 


3212 


-0.74 


0.49 


1393 


1331 


-0.74 


0.49 


5 ■ 


3213 


-0.43 


0.44 


2442 


1332 


-0.43 


0.44 


33 


3221 


-0.74 


0.49 


395 


1333 


-0.05 


0.55 


101 


3222 


-0.43 


0.44 


4886 


2111 


-1.74 ■ 


0.48 


697 


3223 


-0.05 


0.55 


10392 


2112 


-1.46 


0.41 


935 


3231 


-0.43 


0.44 


81 


2113 


-1.15 


0.50 


736 


3232 


-0.05 


0.55 


1533 


2121 


-1.46 


0.41 


268 


3233 


0.45 


0.59 


5519 


2122 


-1.15 


0.50 


741 


3311 


-0.74 


0.49 


200 


2123 


-0.74 


0.49 


723 


3312 


-0.43 


0.44 


703 


2131 


-1.15 


0.50 


15 


3313 


-0.05 


0.55 


1372 


2132 


-0.74 


0.49 


77 


3321 


-0.43 


0.44 


331 


2133 


-0.43 


0.44 


120 


3322 


-0.05 


0.55 


4714 


2211 


-1.46 


0.41 


147 


3323 


0.45 


0.59 


14402 


2212 


-1.15 


0.50 


291 


3331 


-0.05 


0.55 


155 


2213 


-0.74 


0.49 


278 


3332 


0.45 


0.59 


3846 


2221 


-1.15 


0.50 


181 


3333 


1.06 


0.69 


23221 


2222 


-0.74 


0.49 


1243 













Figure Captions 



Figure 1-a. A decision tree for the single stage decision. 

Figure 1-b. A decision tree for the two stage sequential decision. 

Figure 1-c. A decision tree for the three stage sequential decision. 

Figure 1-d. A decision tree for the K stage sequential decision. 

Figure 2-a. A decision tree for the three stage decision. 

Figure 2-h. A decision tree for the three stage decision. 

Figure 3. Category response functions of the four items under the continuation ratio model. 

Figure 4- Category response functions of the four items under the graded response model. 

Figure 5. Category response functions of the four items under the generalized partial credit 
model. 

Figure 6. Category response functions of the four items under the partial credit model. 

Figure 7. Category response functions of the four items under the continuation ratio model, 
the graded response model, the generalized partial credit model, and the partial credit 
model. 

Figure 8. Item information shares of the three categories of Item 1, item infomation 
functions of the four items, and the test information function under the continuation 
ratio model. 

Figure 9. Item information shares of the three categories of Item 1, item infomation 
functions of the four items, and the test information function under the graded response 
model. 



Figure 10. Item information shares of the three categories of Item 1, item infomation 
functions of the four items, and the test information function under the generalized 
partial credit model. 



Figure 11. Item information shares of the three categories of Item 1, item infomation 
functions of the four items, and the test information function under the partial credit 
model. 

Figure 12. Item information shares of the three categories of Item 1, item infomation 
functions of the four items, and the test information function under the continuation 
ratio model, the graded response model, the generalized partial credit model, and the 
partical credit model. 





38 





39 





40 





42 



Probability Probability 



CR Category Response Functions of Item 1 



CR Category Response Functions of Item 2 





Ability 



Ability 



CR Category Response Functions of Item 3 



CR Category Response Functions of Item 4 





Ability Ability 



o 

ERIC 



43 



Probability Probability 



GR Category Response Functions of Item 1 



GR Category Response Functions of Item 2 





Ability 



Ability 



GR Category Response Functions of Item 3 



GR Category Response Functions of Item 4 





Ability 



Ability 



Probability Probability 



GPC Category Response Functions of Item 1 



GPC Category Response Functions of Item 2 




45 



Probability Probability 



PC Category Response Functions of Item 1 



PC Category Response Functions of Item 2 




Probability Probability 



Category Response Functions of Item 1 



Category Response Functions of Item 2 




47 



Information Information Information Information 



CR Item Information Share of Item 1 Category 1 



CR Item Information Share of Item 1 Category 2 




0 

Ability 



CR Item Information Share of Item 1 Category 3 




0 

Ability 



CR Item Information Function of Item 1 




CR Item Information Function of Item 4 



CR Test Information Function 





-2 -1 



3 




- 3 - 2-10123 



Ability 



48 



Ability 



Information Information Information Information 



GR Item Information Share of Item 1 Category 1 



GR Item Information Share of Item 1 Category 2 




GR Item Information Share of Item 1 Category 3 




-3 



0 

Ability 



GR Item Information Function of Item 1 





Ability 



Ability 



GR Item Information Function of Item 2 



GR Item Information Function of Item 3 





Information Information Information Information 



GPC Item Information Share of Item 1 Category 1 



GPC Item Information Share of Item 1 Category 2 





- 3 - 2-10 1 2 3 



- 3 - 2-10123 



Ability 



Ability 



GPC Item Information Share of Item 1 Category 3 



GPC Item Information Function of Item 1 





- 3 - 2-10123 



- 3 - 2-10123 



Ability 



Ability 



GPC Item Information Function of Item 2 



GPC Item Information Function of Item 3 




Information Information Information Information 



PC Item Information Share of Item 1 Category 1 



PC Item Information Share of Item 1 Category 2 





Ability 



Ability 



PC Item Information Share of Item 1 Category 3 



PC Item Information Function of Item 1 





Ability 



Ability 



PC Item Information Function of Item 2 



PC Item Information Function of Item 3 






CVJ - 




- 3 - 2-10123 

Ability 

PC Item Information Function of Item 4 



o 

CD 

o 

*c 

— CVJ - 




- 3 - 2-10123 

Ability 

PC Test Information Function 




Information Information Information Information 



Item Information Share of Item 1 Category 1 



Item Information Share of Item 1 Category 2 




-3 



0 

Ability 




-3 



0 

Ability 



Item Information Share of Item 1 Category 3 



Item Information Function of Item 1 





CR 

GR 

GPC 

PC 


— 1 1 1 1 1 1 1 — 

- 3 - 2-10123 

Ability 

Item Information Function of Item 2 




CR 

GR 

GPC 

PC 



- 3 - 2-10123 



Ability 

Item Information Function of Item 4 




Ability 



Item Information Function of Item 3 




Ability 



Test Information Function 




Acknowledgments 



The author would like to express his gratitude to Fumiko Samejima and Mark D. Reckase 
for their valuable comments and suggestions when the earlier draft was presented at the 
meeting. 




53 



Author’s Address 



Send correspondence to Seock-Ho Kim, Department of Educational Psychology, The Uni- 
versity of Georgia, 325 Aderhold Hall, Athens, GA 30602-7143. Internet: skim@coe.uga.edu 




U.S. Department of Education 

Office of Educational Research and Improvement (OERI) 
National Library of Education (NLE) 

Educational Resources Information Center (ERIC) 

REPRODUCTION RELEASE 

(Specific Document) 



EWC 

TIVI034852 



I. DOCUMENT IDENTIFICATION: 



Title: 



A Continuation Ratio Model for Ordered Category Items 



Author(s): Seock-Ho Kim 



Corporate Source: paper presented at the annual meeting of the 

Psychometric Society, Chapel Hill , North Carolina 



Publication Date: 
June, 2002 



II. REPRODUCTION RELEASE: 

In order to disseminate as widely as possible timely and significant materials of interest to the educational community, documents 
announced in the monthly abstract journal of the ERIC system, Resources in Education (RIE), are usually made available to users in microfiche, 
reproduced paper copy, and electronic media, and sold through the ERIC Document Reproduction Service (EDRS). Credit is given to the source 
of each document, and, if reproduction release is granted, one of the following notices is affixed to the document. 

If permission is granted to reproduce and disseminate the identified document, please CHECK ONE of the following three options and sign 
at the bottom of the page. 



Th6 S 3 mpl 6 sticksr shovvn b6low will bQ Th6 S3mpl6 slick6r shown bolow will b6 Th6 S3mpl6 stickor shown bolow will b6 

affixed to 3 ll Level 1 documents 3ffixed to ell Level 2A documents offixed to all Level 2B documents 



PERMISSION TO REPRODUCE AND 
DISSEMINATE THIS MATERIAL HAS 
BEEN GRANTED BY 




PERMISSION TO REPRODUCE AND 
DISSEMINATE THIS MATERIAL IN 
MICROFICHE. AND IN ELECTRONIC MEDIA 
FOR ERIC COLLECTION SUBSCRIBERS ONLY. 
HAS BEEN GRANTED BY 




PERMISSION TO REPRODUCE AND 
DISSEMINATE THIS MATERIAL IN 
MICROFICHE ONLY HAS BEEN GRANTED BY 










. . . 






.. 








TO THE EDUCATIONAL RESOURCES 
INFORMATION CENTER (ERIC) 




TO THE EDUCATIONAL RESOURCES 
INFORMATION CENTER (ERIC) 




TO THE EDUCATIONAL RESOURCES 
INFORMATION CENTER (ERIC) 


1 




2A 




2B 


Level 1 


Level 2A 

□ 


Level 2B 

□ 


Check here for Level 1 releose, permitting 
reproduction 3nd disseminotion in microfiche or 
other ERIC erchivel medio (e.g., electronic) and 


Check here for Level 2A release, permitting reproduction 
and dissemination in microfiche and in electronic media for 
ERIC archival collection subscribers only 


Check here for Level 2B release, permitting reproduction 
and dissemination in microfiche only 



p3per copy. 



Documents will be processed 3S indicated provided reproduction qu3lity permits. 

If permission to reproduce is gronted, but no box is checked, documents will be processed 3t Level 1. 



/ hereby grant to the Educational Resources Information Center (ERIC) nonexclusive permission to reproduce and disseminate this document 
as indicated above. Reproduction from the ERIC microfiche or electronic media by persons other than ERIC employees and its system 
contractors requires permission from copyright holder. Exception is made for non-profit reproduction by libraries and other service agencies 
to satisfv information needs of educators in responss_to discrete inquiries. 


issor 

f240 


JlsurdL4\ 




Printed Name/Position/Title: 

Seock-Ho Kim, Associate Profi 


Organization/Address: University of Georgia 

325 Aderhold Hall, Athens, GA 30602 


T®'epho"®706-542-4224 


706-542- 


E-Mail Address: 

skim@coe . uea.edu 


Date: 

4/15/03 



o 




r 



I 



III. DOCUMENT AVAILABILITY INFORMATION (FROM NON-ERIC SOURCE): 

If permission to reproduce is not granted to ERIC, or, if you wish ERIC to cite the availability of the document from another source, 
please provide the following information regarding the availability of the document. (ERIC will not announce a document unless it Is 
publicly available, and a dependable source can be specified. Contributors should also be aware that ERIC selection criteria are 
significantly more stringent for documents that cannot be made available through EDRS.) 




IV. REFERRAL OF ERIC TO COPYRIGHT/REPRODUCTION RIGHTS HOLDER: 



If the right to grant this reproduction release is held by someone other than the addressee, please provide the appropriate name and 
address: 




V. WHERE TO SEND THIS FORM: 



Send this form to the following ERIC Clearinghouse: 



However, if solicited by the ERIC Facility, or if making an unsolicited contribution to ERIC, return this form (and the document being 
contributed) to: 



ERIC Processing and Reference Facility 
4483-A Forbes Boulevard 
Lanham, Maryland 20706 



Telephone: 
Toll Free: 
FAX: 
e-mail: 
WWW: 



301-552-4200 

800-799-3742 

301-552-4700 

ericfac@inet.ed.gov 

http://ericfacility.org 



EFF-088 (Rev. 2/2001) 



er|c 



